Overview
Brought to you by YData
Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 6362620 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.6 GiB |
| Average record size in memory | 263.4 B |
Variable types
| Numeric | 6 |
|---|---|
| Categorical | 3 |
| Text | 2 |
amount is highly overall correlated with newbalanceDest and 1 other fields | High correlation |
newbalanceDest is highly overall correlated with amount and 1 other fields | High correlation |
newbalanceOrig is highly overall correlated with oldbalanceOrg | High correlation |
oldbalanceDest is highly overall correlated with amount and 1 other fields | High correlation |
oldbalanceOrg is highly overall correlated with newbalanceOrig | High correlation |
isFraud is highly imbalanced (98.6%) | Imbalance |
isFlaggedFraud is highly imbalanced (> 99.9%) | Imbalance |
amount is highly skewed (γ1 = 30.99394948) | Skewed |
oldbalanceOrg has 2102449 (33.0%) zeros | Zeros |
newbalanceOrig has 3609566 (56.7%) zeros | Zeros |
oldbalanceDest has 2704388 (42.5%) zeros | Zeros |
newbalanceDest has 2439433 (38.3%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-17 05:14:22.401513 |
|---|---|
| Analysis finished | 2025-04-17 05:19:36.206538 |
| Duration | 5 minutes and 13.81 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
step
Real number (ℝ)
| Distinct | 743 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 243.39725 |
| Minimum | 1 |
|---|---|
| Maximum | 743 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 16 |
| Q1 | 156 |
| median | 239 |
| Q3 | 335 |
| 95-th percentile | 490 |
| Maximum | 743 |
| Range | 742 |
| Interquartile range (IQR) | 179 |
Descriptive statistics
| Standard deviation | 142.33197 |
|---|---|
| Coefficient of variation (CV) | 0.58477232 |
| Kurtosis | 0.32907056 |
| Mean | 243.39725 |
| Median Absolute Deviation (MAD) | 92 |
| Skewness | 0.37517689 |
| Sum | 1.5486442 × 109 |
| Variance | 20258.39 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 19 | 51352 | 0.8% |
| 18 | 49579 | 0.8% |
| 187 | 49083 | 0.8% |
| 235 | 47491 | 0.7% |
| 307 | 46968 | 0.7% |
| 163 | 46352 | 0.7% |
| 139 | 46054 | 0.7% |
| 403 | 45155 | 0.7% |
| 43 | 45060 | 0.7% |
| 355 | 44787 | 0.7% |
| Other values (733) | 5890739 |
| Value | Count | Frequency (%) |
| 1 | 2708 | < 0.1% |
| 2 | 1014 | < 0.1% |
| 3 | 552 | < 0.1% |
| 4 | 565 | < 0.1% |
| 5 | 665 | < 0.1% |
| 6 | 1660 | < 0.1% |
| 7 | 6837 | 0.1% |
| 8 | 21097 | |
| 9 | 37628 | |
| 10 | 35991 |
| Value | Count | Frequency (%) |
| 743 | 8 | < 0.1% |
| 742 | 14 | |
| 741 | 22 | |
| 740 | 6 | < 0.1% |
| 739 | 10 | |
| 738 | 10 | |
| 737 | 10 | |
| 736 | 14 | |
| 735 | 12 | |
| 734 | 8 | < 0.1% |
type
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 390.9 MiB |
| CASH_OUT | |
|---|---|
| PAYMENT | |
| CASH_IN | |
| TRANSFER | |
| DEBIT | 41432 |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.422396 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PAYMENT |
|---|---|
| 2nd row | PAYMENT |
| 3rd row | TRANSFER |
| 4th row | CASH_OUT |
| 5th row | PAYMENT |
Common Values
| Value | Count | Frequency (%) |
| CASH_OUT | 2237500 | |
| PAYMENT | 2151495 | |
| CASH_IN | 1399284 | |
| TRANSFER | 532909 | 8.4% |
| DEBIT | 41432 | 0.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cash_out | 2237500 | |
| payment | 2151495 | |
| cash_in | 1399284 | |
| transfer | 532909 | 8.4% |
| debit | 41432 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 6321188 | |
| T | 4963336 | |
| S | 4169693 | |
| N | 4083688 | |
| C | 3636784 | 7.7% |
| _ | 3636784 | 7.7% |
| H | 3636784 | 7.7% |
| E | 2725836 | 5.8% |
| O | 2237500 | 4.7% |
| U | 2237500 | 4.7% |
| Other values (8) | 9576792 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 43589101 | |
| Connector Punctuation | 3636784 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6321188 | |
| T | 4963336 | |
| S | 4169693 | |
| N | 4083688 | |
| C | 3636784 | |
| H | 3636784 | |
| E | 2725836 | 6.3% |
| O | 2237500 | 5.1% |
| U | 2237500 | 5.1% |
| Y | 2151495 | 4.9% |
| Other values (7) | 7425297 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3636784 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43589101 | |
| Common | 3636784 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 6321188 | |
| T | 4963336 | |
| S | 4169693 | |
| N | 4083688 | |
| C | 3636784 | |
| H | 3636784 | |
| E | 2725836 | 6.3% |
| O | 2237500 | 5.1% |
| U | 2237500 | 5.1% |
| Y | 2151495 | 4.9% |
| Other values (7) | 7425297 |
Common
| Value | Count | Frequency (%) |
| _ | 3636784 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47225885 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 6321188 | |
| T | 4963336 | |
| S | 4169693 | |
| N | 4083688 | |
| C | 3636784 | 7.7% |
| _ | 3636784 | 7.7% |
| H | 3636784 | 7.7% |
| E | 2725836 | 5.8% |
| O | 2237500 | 4.7% |
| U | 2237500 | 4.7% |
| Other values (8) | 9576792 |
amount
Real number (ℝ)
High correlation  Skewed 
| Distinct | 5316900 |
|---|---|
| Distinct (%) | 83.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 179861.9 |
| Minimum | 0 |
|---|---|
| Maximum | 92445517 |
| Zeros | 16 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2224.0995 |
| Q1 | 13389.57 |
| median | 74871.94 |
| Q3 | 208721.48 |
| 95-th percentile | 518634.2 |
| Maximum | 92445517 |
| Range | 92445517 |
| Interquartile range (IQR) | 195331.91 |
Descriptive statistics
| Standard deviation | 603858.23 |
|---|---|
| Coefficient of variation (CV) | 3.3573437 |
| Kurtosis | 1797.9567 |
| Mean | 179861.9 |
| Median Absolute Deviation (MAD) | 68393.655 |
| Skewness | 30.993949 |
| Sum | 1.1443929 × 1012 |
| Variance | 3.6464476 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10000000 | 3207 | 0.1% |
| 10000 | 88 | < 0.1% |
| 5000 | 79 | < 0.1% |
| 15000 | 68 | < 0.1% |
| 500 | 65 | < 0.1% |
| 100000 | 42 | < 0.1% |
| 21500 | 37 | < 0.1% |
| 120000 | 29 | < 0.1% |
| 135000 | 20 | < 0.1% |
| 0 | 16 | < 0.1% |
| Other values (5316890) | 6358969 |
| Value | Count | Frequency (%) |
| 0 | 16 | |
| 0.01 | 1 | < 0.1% |
| 0.02 | 3 | < 0.1% |
| 0.03 | 2 | < 0.1% |
| 0.04 | 1 | < 0.1% |
| 0.06 | 1 | < 0.1% |
| 0.07 | 1 | < 0.1% |
| 0.09 | 1 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.11 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 92445516.64 | 1 | |
| 73823490.36 | 1 | |
| 71172480.42 | 1 | |
| 69886731.3 | 1 | |
| 69337316.27 | 1 | |
| 67500761.29 | 1 | |
| 66761272.21 | 1 | |
| 64234448.19 | 1 | |
| 63847992.58 | 1 | |
| 63294839.63 | 1 |
nameOrig
Text
| Distinct | 6353307 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 409.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.482323 |
| Min length | 5 |
Unique
| Unique | 6344009 ? |
|---|---|
| Unique (%) | 99.7% |
Sample
| 1st row | C1231006815 |
|---|---|
| 2nd row | C1666544295 |
| 3rd row | C1305486145 |
| 4th row | C840083671 |
| 5th row | C2048537720 |
| Value | Count | Frequency (%) |
| c2098525306 | 3 | < 0.1% |
| c1999539787 | 3 | < 0.1% |
| c1065307291 | 3 | < 0.1% |
| c1462946854 | 3 | < 0.1% |
| c1677795071 | 3 | < 0.1% |
| c1784010646 | 3 | < 0.1% |
| c724452879 | 3 | < 0.1% |
| c545315117 | 3 | < 0.1% |
| c1530544995 | 3 | < 0.1% |
| c400299098 | 3 | < 0.1% |
| Other values (6353297) | 6362590 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 8803448 | |
| C | 6362620 | |
| 2 | 6136135 | |
| 3 | 5699596 | |
| 4 | 5693146 | |
| 7 | 5669437 | |
| 5 | 5668010 | |
| 6 | 5667725 | |
| 0 | 5667074 | |
| 9 | 5665212 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 60332420 | |
| Uppercase Letter | 6362620 | 9.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8803448 | |
| 2 | 6136135 | |
| 3 | 5699596 | |
| 4 | 5693146 | |
| 7 | 5669437 | |
| 5 | 5668010 | |
| 6 | 5667725 | |
| 0 | 5667074 | |
| 9 | 5665212 | |
| 8 | 5662637 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 6362620 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 60332420 | |
| Latin | 6362620 | 9.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 8803448 | |
| 2 | 6136135 | |
| 3 | 5699596 | |
| 4 | 5693146 | |
| 7 | 5669437 | |
| 5 | 5668010 | |
| 6 | 5667725 | |
| 0 | 5667074 | |
| 9 | 5665212 | |
| 8 | 5662637 |
Latin
| Value | Count | Frequency (%) |
| C | 6362620 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 66695040 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 8803448 | |
| C | 6362620 | |
| 2 | 6136135 | |
| 3 | 5699596 | |
| 4 | 5693146 | |
| 7 | 5669437 | |
| 5 | 5668010 | |
| 6 | 5667725 | |
| 0 | 5667074 | |
| 9 | 5665212 |
oldbalanceOrg
Real number (ℝ)
High correlation  Zeros 
| Distinct | 1845844 |
|---|---|
| Distinct (%) | 29.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 833883.1 |
| Minimum | 0 |
|---|---|
| Maximum | 59585040 |
| Zeros | 2102449 |
| Zeros (%) | 33.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 14208 |
| Q3 | 107315.18 |
| 95-th percentile | 5823702.3 |
| Maximum | 59585040 |
| Range | 59585040 |
| Interquartile range (IQR) | 107315.18 |
Descriptive statistics
| Standard deviation | 2888242.7 |
|---|---|
| Coefficient of variation (CV) | 3.4636062 |
| Kurtosis | 32.964879 |
| Mean | 833883.1 |
| Median Absolute Deviation (MAD) | 14208 |
| Skewness | 5.2491364 |
| Sum | 5.3056813 × 1012 |
| Variance | 8.3419457 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2102449 | |
| 184 | 918 | < 0.1% |
| 133 | 914 | < 0.1% |
| 195 | 912 | < 0.1% |
| 164 | 909 | < 0.1% |
| 109 | 908 | < 0.1% |
| 181 | 908 | < 0.1% |
| 157 | 902 | < 0.1% |
| 146 | 899 | < 0.1% |
| 136 | 898 | < 0.1% |
| Other values (1845834) | 4252003 |
| Value | Count | Frequency (%) |
| 0 | 2102449 | |
| 0.05 | 1 | < 0.1% |
| 0.18 | 1 | < 0.1% |
| 0.21 | 1 | < 0.1% |
| 0.44 | 1 | < 0.1% |
| 0.67 | 1 | < 0.1% |
| 1 | 370 | < 0.1% |
| 1.02 | 1 | < 0.1% |
| 1.37 | 1 | < 0.1% |
| 1.38 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 59585040.37 | 1 | |
| 57316255.05 | 1 | |
| 50399045.08 | 1 | |
| 49585040.37 | 1 | |
| 47316255.05 | 1 | |
| 45674547.89 | 1 | |
| 44892193.09 | 1 | |
| 43818855.3 | 1 | |
| 43686616.33 | 1 | |
| 42542664.27 | 1 |
newbalanceOrig
Real number (ℝ)
High correlation  Zeros 
| Distinct | 2682586 |
|---|---|
| Distinct (%) | 42.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 855113.67 |
| Minimum | 0 |
|---|---|
| Maximum | 49585040 |
| Zeros | 3609566 |
| Zeros (%) | 56.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 144258.41 |
| 95-th percentile | 5980262.3 |
| Maximum | 49585040 |
| Range | 49585040 |
| Interquartile range (IQR) | 144258.41 |
Descriptive statistics
| Standard deviation | 2924048.5 |
|---|---|
| Coefficient of variation (CV) | 3.4194852 |
| Kurtosis | 32.066985 |
| Mean | 855113.67 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.176884 |
| Sum | 5.4407633 × 1012 |
| Variance | 8.5500596 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3609566 | |
| 26099.09 | 4 | < 0.1% |
| 3684.32 | 4 | < 0.1% |
| 18672.58 | 4 | < 0.1% |
| 38767.21 | 4 | < 0.1% |
| 7717.83 | 4 | < 0.1% |
| 366.96 | 4 | < 0.1% |
| 10528.49 | 4 | < 0.1% |
| 8396.98 | 4 | < 0.1% |
| 785.66 | 4 | < 0.1% |
| Other values (2682576) | 2753018 |
| Value | Count | Frequency (%) |
| 0 | 3609566 | |
| 0.01 | 1 | < 0.1% |
| 0.03 | 1 | < 0.1% |
| 0.05 | 1 | < 0.1% |
| 0.12 | 1 | < 0.1% |
| 0.13 | 1 | < 0.1% |
| 0.18 | 1 | < 0.1% |
| 0.21 | 1 | < 0.1% |
| 0.23 | 1 | < 0.1% |
| 0.3 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 49585040.37 | 1 | |
| 47316255.05 | 1 | |
| 43686616.33 | 1 | |
| 43673802.21 | 1 | |
| 41690842.64 | 1 | |
| 41432359.46 | 1 | |
| 40399045.08 | 1 | |
| 39585040.37 | 1 | |
| 38946233.02 | 1 | |
| 38939424.03 | 1 |
nameDest
Text
| Distinct | 2722362 |
|---|---|
| Distinct (%) | 42.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 409.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.481752 |
| Min length | 2 |
Unique
| Unique | 2262704 ? |
|---|---|
| Unique (%) | 35.6% |
Sample
| 1st row | M1979787155 |
|---|---|
| 2nd row | M2044282225 |
| 3rd row | C553264065 |
| 4th row | C38997010 |
| 5th row | M1230701703 |
| Value | Count | Frequency (%) |
| c1286084959 | 113 | < 0.1% |
| c985934102 | 109 | < 0.1% |
| c665576141 | 105 | < 0.1% |
| c2083562754 | 102 | < 0.1% |
| c1590550415 | 101 | < 0.1% |
| c248609774 | 101 | < 0.1% |
| c451111351 | 99 | < 0.1% |
| c1789550256 | 99 | < 0.1% |
| c1360767589 | 98 | < 0.1% |
| c1023714065 | 97 | < 0.1% |
| Other values (2722352) | 6361596 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 8799996 | |
| 2 | 6133780 | |
| 3 | 5704404 | |
| 4 | 5691070 | |
| 8 | 5675627 | |
| 9 | 5668861 | |
| 7 | 5665128 | |
| 0 | 5664751 | |
| 6 | 5662897 | |
| 5 | 5662271 | |
| Other values (2) | 6362620 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 60328785 | |
| Uppercase Letter | 6362620 | 9.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8799996 | |
| 2 | 6133780 | |
| 3 | 5704404 | |
| 4 | 5691070 | |
| 8 | 5675627 | |
| 9 | 5668861 | |
| 7 | 5665128 | |
| 0 | 5664751 | |
| 6 | 5662897 | |
| 5 | 5662271 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 4211125 | |
| M | 2151495 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 60328785 | |
| Latin | 6362620 | 9.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 8799996 | |
| 2 | 6133780 | |
| 3 | 5704404 | |
| 4 | 5691070 | |
| 8 | 5675627 | |
| 9 | 5668861 | |
| 7 | 5665128 | |
| 0 | 5664751 | |
| 6 | 5662897 | |
| 5 | 5662271 |
Latin
| Value | Count | Frequency (%) |
| C | 4211125 | |
| M | 2151495 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 66691405 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 8799996 | |
| 2 | 6133780 | |
| 3 | 5704404 | |
| 4 | 5691070 | |
| 8 | 5675627 | |
| 9 | 5668861 | |
| 7 | 5665128 | |
| 0 | 5664751 | |
| 6 | 5662897 | |
| 5 | 5662271 | |
| Other values (2) | 6362620 |
oldbalanceDest
Real number (ℝ)
High correlation  Zeros 
| Distinct | 3614697 |
|---|---|
| Distinct (%) | 56.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1100701.7 |
| Minimum | 0 |
|---|---|
| Maximum | 3.5601589 × 108 |
| Zeros | 2704388 |
| Zeros (%) | 42.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 132705.66 |
| Q3 | 943036.71 |
| 95-th percentile | 5147229.7 |
| Maximum | 3.5601589 × 108 |
| Range | 3.5601589 × 108 |
| Interquartile range (IQR) | 943036.71 |
Descriptive statistics
| Standard deviation | 3399180.1 |
|---|---|
| Coefficient of variation (CV) | 3.0881938 |
| Kurtosis | 948.67413 |
| Mean | 1100701.7 |
| Median Absolute Deviation (MAD) | 132705.66 |
| Skewness | 19.921758 |
| Sum | 7.0033464 × 1012 |
| Variance | 1.1554425 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2704388 | |
| 10000000 | 615 | < 0.1% |
| 20000000 | 219 | < 0.1% |
| 30000000 | 86 | < 0.1% |
| 40000000 | 31 | < 0.1% |
| 102 | 21 | < 0.1% |
| 198 | 19 | < 0.1% |
| 125 | 18 | < 0.1% |
| 160 | 18 | < 0.1% |
| 132 | 18 | < 0.1% |
| Other values (3614687) | 3657187 |
| Value | Count | Frequency (%) |
| 0 | 2704388 | |
| 0.01 | 1 | < 0.1% |
| 0.03 | 1 | < 0.1% |
| 0.13 | 1 | < 0.1% |
| 0.33 | 1 | < 0.1% |
| 0.37 | 1 | < 0.1% |
| 0.79 | 1 | < 0.1% |
| 1 | 7 | < 0.1% |
| 1.39 | 1 | < 0.1% |
| 1.64 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 356015889.4 | 1 | |
| 355553416.3 | 1 | |
| 355381433.6 | 1 | |
| 355380483.5 | 1 | |
| 355185537.1 | 1 | |
| 328194464.9 | 1 | |
| 327998074.2 | 1 | |
| 327963024 | 1 | |
| 327852121.4 | 1 | |
| 327827763.4 | 1 |
newbalanceDest
Real number (ℝ)
High correlation  Zeros 
| Distinct | 3555499 |
|---|---|
| Distinct (%) | 55.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1224996.4 |
| Minimum | 0 |
|---|---|
| Maximum | 3.5617928 × 108 |
| Zeros | 2439433 |
| Zeros (%) | 38.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 48.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 214661.44 |
| Q3 | 1111909.2 |
| 95-th percentile | 5515715.9 |
| Maximum | 3.5617928 × 108 |
| Range | 3.5617928 × 108 |
| Interquartile range (IQR) | 1111909.2 |
Descriptive statistics
| Standard deviation | 3674128.9 |
|---|---|
| Coefficient of variation (CV) | 2.9992978 |
| Kurtosis | 862.15651 |
| Mean | 1224996.4 |
| Median Absolute Deviation (MAD) | 214661.44 |
| Skewness | 19.352302 |
| Sum | 7.7941866 × 1012 |
| Variance | 1.3499223 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2439433 | |
| 10000000 | 53 | < 0.1% |
| 971418.91 | 32 | < 0.1% |
| 19169204.93 | 29 | < 0.1% |
| 16532032.16 | 25 | < 0.1% |
| 1254956.07 | 25 | < 0.1% |
| 1412484.09 | 22 | < 0.1% |
| 1178808.14 | 21 | < 0.1% |
| 7364724.84 | 21 | < 0.1% |
| 4743010.67 | 21 | < 0.1% |
| Other values (3555489) | 3922938 |
| Value | Count | Frequency (%) |
| 0 | 2439433 | |
| 0.01 | 1 | < 0.1% |
| 0.33 | 1 | < 0.1% |
| 1.39 | 1 | < 0.1% |
| 1.64 | 1 | < 0.1% |
| 1.74 | 1 | < 0.1% |
| 2.15 | 1 | < 0.1% |
| 2.45 | 1 | < 0.1% |
| 2.71 | 1 | < 0.1% |
| 2.76 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 356179278.9 | 1 | |
| 356015889.4 | 1 | |
| 355553416.3 | 2 | |
| 355381433.6 | 1 | |
| 355380483.5 | 1 | |
| 355185537.1 | 1 | |
| 328431698.2 | 1 | |
| 328194464.9 | 1 | |
| 327998074.2 | 1 | |
| 327963024 | 1 |
isFraud
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 351.9 MiB |
| 0 | |
|---|---|
| 1 | 8213 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 6354407 | |
| 1 | 8213 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 6354407 | |
| 1 | 8213 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6354407 | |
| 1 | 8213 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6362620 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6354407 | |
| 1 | 8213 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6362620 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6354407 | |
| 1 | 8213 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6362620 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6354407 | |
| 1 | 8213 | 0.1% |
isFlaggedFraud
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 351.9 MiB |
| 0 | |
|---|---|
| 1 | 16 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 6362604 | |
| 1 | 16 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 6362604 | |
| 1 | 16 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6362604 | |
| 1 | 16 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6362620 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6362604 | |
| 1 | 16 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6362620 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6362604 | |
| 1 | 16 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6362620 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6362604 | |
| 1 | 16 | < 0.1% |
Interactions
Correlations
| amount | isFlaggedFraud | isFraud | newbalanceDest | newbalanceOrig | oldbalanceDest | oldbalanceOrg | step | type | |
|---|---|---|---|---|---|---|---|---|---|
| amount | 1.000 | 0.014 | 0.049 | 0.670 | -0.071 | 0.595 | 0.048 | 0.001 | 0.050 |
| isFlaggedFraud | 0.014 | 1.000 | 0.043 | 0.000 | 0.005 | 0.000 | 0.003 | 0.006 | 0.005 |
| isFraud | 0.049 | 0.043 | 1.000 | 0.002 | 0.019 | 0.002 | 0.031 | 0.059 | 0.059 |
| newbalanceDest | 0.670 | 0.000 | 0.002 | 1.000 | -0.094 | 0.936 | -0.008 | -0.005 | 0.027 |
| newbalanceOrig | -0.071 | 0.005 | 0.019 | -0.094 | 1.000 | 0.044 | 0.803 | -0.011 | 0.238 |
| oldbalanceDest | 0.595 | 0.000 | 0.002 | 0.936 | 0.044 | 1.000 | 0.024 | -0.005 | 0.017 |
| oldbalanceOrg | 0.048 | 0.003 | 0.031 | -0.008 | 0.803 | 0.024 | 1.000 | -0.006 | 0.213 |
| step | 0.001 | 0.006 | 0.059 | -0.005 | -0.011 | -0.005 | -0.006 | 1.000 | 0.011 |
| type | 0.050 | 0.005 | 0.059 | 0.027 | 0.238 | 0.017 | 0.213 | 0.011 | 1.000 |
Missing values
Sample
| step | type | amount | nameOrig | oldbalanceOrg | newbalanceOrig | nameDest | oldbalanceDest | newbalanceDest | isFraud | isFlaggedFraud | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | PAYMENT | 9839.64 | C1231006815 | 170136.00 | 160296.36 | M1979787155 | 0.0 | 0.00 | 0 | 0 |
| 1 | 1 | PAYMENT | 1864.28 | C1666544295 | 21249.00 | 19384.72 | M2044282225 | 0.0 | 0.00 | 0 | 0 |
| 2 | 1 | TRANSFER | 181.00 | C1305486145 | 181.00 | 0.00 | C553264065 | 0.0 | 0.00 | 1 | 0 |
| 3 | 1 | CASH_OUT | 181.00 | C840083671 | 181.00 | 0.00 | C38997010 | 21182.0 | 0.00 | 1 | 0 |
| 4 | 1 | PAYMENT | 11668.14 | C2048537720 | 41554.00 | 29885.86 | M1230701703 | 0.0 | 0.00 | 0 | 0 |
| 5 | 1 | PAYMENT | 7817.71 | C90045638 | 53860.00 | 46042.29 | M573487274 | 0.0 | 0.00 | 0 | 0 |
| 6 | 1 | PAYMENT | 7107.77 | C154988899 | 183195.00 | 176087.23 | M408069119 | 0.0 | 0.00 | 0 | 0 |
| 7 | 1 | PAYMENT | 7861.64 | C1912850431 | 176087.23 | 168225.59 | M633326333 | 0.0 | 0.00 | 0 | 0 |
| 8 | 1 | PAYMENT | 4024.36 | C1265012928 | 2671.00 | 0.00 | M1176932104 | 0.0 | 0.00 | 0 | 0 |
| 9 | 1 | DEBIT | 5337.77 | C712410124 | 41720.00 | 36382.23 | C195600860 | 41898.0 | 40348.79 | 0 | 0 |
| step | type | amount | nameOrig | oldbalanceOrg | newbalanceOrig | nameDest | oldbalanceDest | newbalanceDest | isFraud | isFlaggedFraud | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 6362610 | 742 | TRANSFER | 63416.99 | C778071008 | 63416.99 | 0.0 | C1812552860 | 0.00 | 0.00 | 1 | 0 |
| 6362611 | 742 | CASH_OUT | 63416.99 | C994950684 | 63416.99 | 0.0 | C1662241365 | 276433.18 | 339850.17 | 1 | 0 |
| 6362612 | 743 | TRANSFER | 1258818.82 | C1531301470 | 1258818.82 | 0.0 | C1470998563 | 0.00 | 0.00 | 1 | 0 |
| 6362613 | 743 | CASH_OUT | 1258818.82 | C1436118706 | 1258818.82 | 0.0 | C1240760502 | 503464.50 | 1762283.33 | 1 | 0 |
| 6362614 | 743 | TRANSFER | 339682.13 | C2013999242 | 339682.13 | 0.0 | C1850423904 | 0.00 | 0.00 | 1 | 0 |
| 6362615 | 743 | CASH_OUT | 339682.13 | C786484425 | 339682.13 | 0.0 | C776919290 | 0.00 | 339682.13 | 1 | 0 |
| 6362616 | 743 | TRANSFER | 6311409.28 | C1529008245 | 6311409.28 | 0.0 | C1881841831 | 0.00 | 0.00 | 1 | 0 |
| 6362617 | 743 | CASH_OUT | 6311409.28 | C1162922333 | 6311409.28 | 0.0 | C1365125890 | 68488.84 | 6379898.11 | 1 | 0 |
| 6362618 | 743 | TRANSFER | 850002.52 | C1685995037 | 850002.52 | 0.0 | C2080388513 | 0.00 | 0.00 | 1 | 0 |
| 6362619 | 743 | CASH_OUT | 850002.52 | C1280323807 | 850002.52 | 0.0 | C873221189 | 6510099.11 | 7360101.63 | 1 | 0 |